WORLD INTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 



PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/10, C12Q 1/68 // C12N 9/00 



Al 



(11) International Publication Number: 



WO 98/05765 



(43) International Publication Date: 12 February 1998 (12.02.98) 



(21) International Application Number: 

(22) International Filing Date: 



PCT/DK97/00317 



23 July 1997 (23.07.97) 



(30) Priority Data: 

8/208422 



7 August 1996 (07.08.96) 



JP 



(71) Applicant (for all designated States except US): NOVO 

NORDISK A/S [DK/DK]; Novo A116, DK-2880 Bagsvaerd 
(DK). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): MIYOTA, Yoshiaki 
[JP/JP]; Showa Denko K.K., Central Research Laboratory, 
1-1, Ohnodai 1-chome, Midori-ku, Chiba-shi, Chiba 267 
(JP). FUKUYAMA, Shiro [JP/JP]; Showa Denko K.K., 
Central Research Laboratory, 1-1, Ohnodai 1-chome, 
Midori-ku, Chiba-shi, Chiba 267 (JP). 

(74) Common Representative: NOVO NORDISK A/S; Corporate 
Patents, Novo Alle\ DK-2880 Bagsvaerd (DK). 



(81) Designated States: AL, AM, AT, AU, AZ t BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
HU, IL, IS, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, 
LU, LV, MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, 
RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, UA, 
UG, US, UZ, VN, ZW, ARIPO patent (GH, KE, LS, MW, 
SD, SZ, UG, ZW), Eurasian patent (AM, AZ, BY, KG, KZ, 
MD, RU, TJ t TM), European patent (AT, BE, CH, DE, DK, 
ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI 
patent (BF, BJ, CF, CG, CI, CM, GA, GN, ML, MR, NE, 
SN, TD, TG). 



Published 

With international search report 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: DOUBLE-STRANDED DNA WITH COHESIVE END(S), AND METHOD OF SHUFFLING DNA USING THE SAME 
(57) Abstract 

To provide a method of mutation of DNAs, which is substantially different from the conventional methods applicable to naturally- 
existing DNAs, and also to provide useful genetic products to be produced by the use of thus-mutated DNAs. A DNA with a cohesive end 
comprising (a) a double-stranded DNA having the same sequence as that of a part of a gene, and (b) a single-stranded DNA having a base 
sequence that exists on said gene at the site not adjoining the part corresponding to said double-stranded DNA or a base sequence which 
said gene does not have, wherein the single-stranded DNA is linked to either one end of the double-stranded DNA to form a cohesive end; 
a method for producing it; a method of shuffling a DNA using it; a DNA and a DNA pool to be obtained by the shuffling method; a method 
for producing the DNA pool; and a genetic product to be obtained by expressing the genetic information existing in the DNA pool. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cote d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 98/05765 



1 



PCT7DK97/00317 



Title: Double-stranded DNA with cohesive end(s), and method of 
shuffling DNA using the same 

5 

INUSTRIAL FIELD 

The present invention relates to a double-stranded DNA 
with a cohesive end or cohesive ends having a desired sequence 
and a method for producing it, and also a method for shuffling 
10 a DNA using DNA blocks with a cohesive end or cohesive ends, 
the DNA as shuffled according to the method, a DNA pool to be 
obtained according to the shuffling method, and also a genetic 
product to be produced by the use of the DNA pool. 

15 BACKGROUND ART 

One approach to protein engineering for improving natu- 
rally-existing proteins to modified ones which are more useful 
to human beings is to improve proteins through site-specific 
mutation, which has produced some results (Japanese Patent 

20 Application Laid-open No. 5-91876) . However, this requires the 
clarification or identification of the stereostructure of the 
targeted protein, and much labor is needed for the analysis of 
the stereostructure. In addition, even though the stereostruc- 
ture could be clarified or identified, there are still many 

2 5 unknown matters for the relationship between the structure and 
the function with proteins. Therefore, it is still difficult 
to surely impart an intended function to the targeted protein. 
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In order to overcome these difficulties, a process compri- 
sing random mutation and screening and also evolutional 
molecular engineering that utilizes the evolution of organisms 
have been being highlighted and said to be extremely useful 
5 (Proc. Natl. Acad. Sci., USA, 83., 576 (1986)). However, the 
current methods are directed to the substitution of at most 
several amino acids. 

In W095/22625, disclosed is a method for forming novel 
genes by dividing a plurality of genes at random and homolo- 
10 gously recombining them to reconstruct novel genes. However, 
this is one method for forming chimera genes. The genes to be 
formed by this method are similar to the original genes, and 
the former shall have the essential base sequences of the 
latter . 

15 Using such known methods, it is difficult to desire the 

impartation of some additional functions to organisms which 
they could not gain during the steps of their evolution. In 
order to obtain genetic products, of which the functions are 
greatly different from those of naturally-existing substances 
20 such as proteins, it is believed effective to prepare a pool of 
nucleic acids having significantly different base sequence 
spaces from those existing naturally, and to produce from them 
genetic products having the intended functions. 

One method for this may be to prepare a nucleic acid pool 
25 that covers all base combinations. However, even the total 
number of the base sequences that may code for a relatively 
small protein with 100 amino acids (300 bp) is an enormous 
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number of 4 300 (about 10 180 ) , and it is in fact impossible to 
prepare the nucleic acid pool that may cover all of them- 

For proteins of some kinds, their sub-structures which are 
referred to as modules were specifically noted, and an attempt 
5 was made to change the sequencing of the base sequence blocks 
corresponding to the individual modules to thereby produce 
mutants having different module sequences (Viva Origino, Vol. 
23 , No. 1 (1995) 86-87) . In this attempt, however, the base 
sequences were re-sequenced merely individually for the 

10 individual mutants. No one has heretofore attempted the forma- 
tion of a nucleic acid pool covering all re-sequenced molecules 
and the collection of genes capable of expressing products ha- 
ving intended properties from the pool. 

Utilizing restriction enzymes, it is possible to prepare a 

15 nucleic acid pool including various molecules by blending seve- 
ral kinds of DNA blocks having the same cohesive end or blunt 
end followed by ligating them at random, and to select from 
this pool some molecules having desired properties. According 
to this method, however, the DNAs to be used must have prede- 

20 termined restriction enzyme recognizing sites. Even though the 
DNAs have such restriction enzyme recognizing sites, there is 
an extremely small probability that the sites are positioned at 
the desired sites. In this method, in addition, the both ends 
of the blocks must be of the same type, and there is a high 

25 probability that the blocks are therefore self-ligated . A 
means of forming the restriction enzyme recognizing sites 
through site-specific mutation may be taken in order to over- 
come these problems. However, the matter as to whether or not 
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the blocks can be ligated in accordance with the predetermined 
frame is likely much governed by chance. That is, the matter 
as to whether or not a desired protein can be produced without 
misreading the reading frame of the codon shall be governed by 
5 chance. Therefore, the method is extremely inefficient. 

The subject matter of the present invention is to provide 
a method for efficiently obtaining base sequences that exist in 
spaces greatly different from those of naturally-existing base 
sequences, and also to provide genetic products to be obtained 

10 by expressing, as genes, the nucleic acid sequences that are 
obtained in that manner and that do not exist naturally. 

The sequence space of a gene includes the full-length 
sequence thereof to be theoretically constituted by a combina- 
tion of four bases, A, G, C and T. For example, a base sequen- 

15 ce that codes for a protein composed of a number M n" of amino 
acids shall be constructed by selecting and sequencing any de- 
sired one of the four bases for a total of 3n-times, therefore 

including 4 3n combinations. Accordingly, a protein composed of 
100 amino acids shall include different base sequences of about 

20 10*8 types as so mentioned hereinabove. 

In fact, there is no limitation for the number of amino 
acids that constitute proteins. Therefore, the sequencing spa- 
ces for proteins shall extend unlimitedly. During the steps of 
evolution of organisms, only a part of such sequencing spaces 

25 have been examined, and there is a great probability that some 
sequences coding for proteins which may have some extremely 
excellent functions could exist in the other great sequencing 
spaces. The protein engineering studies which have been and 
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are being made in many laboratories and institutes at present 
are essentially directed to the creation of novel proteins 
having functions superior to those of naturally-existing 
proteins, and one essential approach made therein to this pur- 
5 pose is to substitute amino acids in existing sequences, as so 
mentioned hereinabove. However, the amino acid substitution is 
nothing but the essential means that organisms have carried out 
during the steps of their evolution or, that is, such is the 
imitation of organisms and is to search only around the 

10 sequences that organisms already examined. In addition, there 
is a probability that the sequences thus obtained will be those 
that were already weeded out in the past. 

We, the present inventors have considered that, in order 
to be greatly apart from the sequencing spaces that organisms 

15 already examined, if we carry out such matters that could not 
have been carried out by organisms, the purpose will be 
attained. We know that the division of a gene into several 
blocks followed by the change in the sequencing of the thus-di- 
vided blocks, if occurred in organisms, shall kill the 

20 organisms. Therefore, we have concluded that this method is 
suitable for our purpose. Having thus concluded, we, the pre- 
sent inventors have assiduously studied various matters rela- 
ting to this method and, as a result, have found a method of 
forming a desired cohesive end or ends on a desired DNA. Uti- 

25 lizing this method, we have succeeded in a method of dividing a 
gene into several blocks and re-sequencing them into different 
sequences and also in a method of producing a molecule pool 
including such different base sequences existing in different 
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sequencing spaces, and thus have completed the present inven- 
tion. 

Accordingly, the present invention provides the following: 

1) A DNA with a cohesive end comprising (a) a double- 
5 stranded DNA having the same sequence as that of a part of a 

gene, and (b) a single-stranded DNA having a base sequence that 
exists on said gene at the site not adjoining the part cor- 
responding to said double-stranded DNA or a base sequence which 
said gene does not have, wherein the single-stranded DNA is 
10 linked to either one end of the double-stranded DNA to form a 
cohesive end, 

2) A DNA with cohesive ends comprising (a) a double- 
stranded DNA having the same sequence as that of a part of a 
gene, (b) a first, single-stranded DNA having a base sequence 

15 that exists on said gene at the site not adjoining the part 
corresponding to said double-stranded DNA or a base sequence 
which said gene does not have, and (c) a second, single- 
stranded DNA having a base sequence that exists on said gene at 
the site adjoining the part corresponding to said double- 

20 stranded DNA, wherein the second, single-stranded DNA is linked 
to said double-stranded DNA at one end corresponding to said 
adjoining site, while the first, single-stranded DNA is linked 
thereto at the other end of the complementary strand opposite 
to said end, thereby forming cohesive ends. 

25 3) The DNA with a cohesive end or cohesive ends according 

to the previous 1) or 2) , wherein the single-stranded DNA has a 
length of 2 bases or more. 
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4) The DNA with a cohesive end or cohesive ends according 
to any one of the previous 1) to 3) , wherein the cohesive 
end/ends is/are positioned at the 3 1 -terminal/ terminals . 

5) A method for producing a DNA with a cohesive end or 
5 cohesive ends, wherein a part of a DNA, as a template, and an 

oligonucleotide containing at least one ribonucleotide, as a 
primer, are subjected to DNA polymerase reaction to prepare a 
double-stranded DNA, then the ribonucleotide (s) is/are removed 
through enzymatic reaction or chemical reaction, and the nu- 
10 cleotide(s) remaining at the 5 1 -terminal (s) of the site(s) at 
which said ribonucleotide (s) existed are removed. 

6) A method for producing the DNA with a cohesive end of 
the previous 1) , comprising the following steps a) to d) : 

a) a step of linking (i) an oligonucleotide having the 
15 same base sequence as that of a part of a gene DNA to (ii) an 

oligonucleotide having a base sequence that exists on the gene 
at the site not adjoining the base sequence of (i) or a base 
sequence which the gene does not have, and containing at least 
one ribonucleotide, in such a manner that the oligonucleotide 
20 (ii) is positioned at the 5' -terminal of the oligonucleotide 

(D ; 

b) a step of preparing a double-stranded DNA through DNA 
polymerase reaction between a DNA containing the part cor- 
responding to the oligonucleotide (i) in said a) , as a tem- 

25 plate, and the linked oligonucleotide as obtained in the 
previous step a) , as a primer; 
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c) a step of removing the ribonucleotide from said double- 
stranded DNA through enzymatic reaction or chemical reaction; 
and 

d) a step of removing the nucleotide remaining at the 5 f - 
5 terminal of the site at which said ribonucleotide existed. 

7) A method for producing the DNA with cohesive ends of 
the previous 2) , comprising the following steps a) to d) : 

a) a step of linking (i) an oligonucleotide having the 
same base sequence as that of a part of a gene DNA to (ii) an 

10 oligonucleotide having a base sequence that exists on the gene 
at the site not adjoining the base sequence of (i) or a base 
sequence which the gene does not have, and containing at least 
one ribonucleotide , in such a manner that the oligonucleotide 
(ii) is positioned at the 5' -terminal of the oligonucleotide 

15 (i); 

b) a step of preparing a double-stranded DNA through DNA 
polymerase reaction between a DNA containing the part corre- 
sponding to the oligonucleotide (i) in said a) , as a template, 
and (i) the linked oligonucleotide as obtained in the previous 

20 step a) and (ii) an oligonucleotide which is a complementary 
strand of an oligonucleotide existing on the gene at the site 
separated from said oligonucleotide-corresponding part by at 
least 3 bases or more toward the 3 1 -terminal and which contains 
at least one ribonucleotide, as primers; 

25 c) a step of removing the ribonucleotides from said 

double-stranded DNA through enzymatic reaction or chemical 
reaction; and 



WO 98/05765 PCTVDK97/003 1 7 

9 

d) a step of removing the nucleotides remaining at the 5 1 - 
terminals of the sites at which said ribonucleotides existed. 

8) A method for shuffling a DNA, comprising dividing a 
DNA into a plurality of DNA blocks each having a cohesive end 

5 or cohesive ends, followed by ligating them together into a 
sequence that is different from the sequence of the original , 
non-divided DNA. 

9) A method for shuffling a DNA, comprising applying the 
method of any one of the previous 5) to 7) to various sites of 

10 a DNA, thereby dividing the DNA into a plurality of DNA blocks 
each having a cohesive end or cohesive ends, at least one block 
of which shall have a cohesive end that is complementary to the 
cohesive end of another block not having been directly adjacent 
to said one block on the original DNA , followed by ligating 

15 them together into a sequence that is different from the 
sequence of the original, non-divided DNA. 

10) The shuffling method according to the previous 8) or 
9) , wherein the DNA is divided into 3 or more blocks. 

11) The shuffling method according to any one of the 
20 previous 8) to 10) , wherein the blocks are ligated together 

using a DNA ligase. 

12) A DNA as shuffled according to the method of any one 
of the previous 8) to 11) . 

[0016] 

25 13) The DNA according to the previous 12), wherein a gene 

coding for an enzymatic function or a control gene for the gene 
is shuffled. 
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14) The DNA according to the previous 13) , wherein the 
gene is a gene that codes for any one of proteases, lipases, 
cellulases, amylases, catalases, xylanases, oxidases, dehydro- 
genases, oxygenases and reductases. 
5 15) The DNA according to the previous 13) or 14) , wherein 

the gene is one derived from prokaryotes. 

16) The DNA according to the previous 15) , wherein the 
gene is one derived from bacillus bacteria* 

17) The DNA according to the previous 16) , wherein the 
10 gene is a protease API21 gene. 

18) A DNA pool containing plural kinds of DNAs having 
different structures that are obtained according to the 
shuffling method of any one of the previous 8 to 11) . 

19) The DNA pool according to the previous 18) , which 
15 contains 10 or more kinds of DNAs. 

20) A method for producing a DNA pool, comprising 
applying the method of any one of the previous 5) to 7) to 
various sites of a template DNA to thereby prepare a mixture of 
DNA blocks each having a cohesive end or cohesive ends that 

20 satisfies the following conditions, followed by ligating these 
into any desired sequences: 

Condition 1: Each block has a double-stranded site having 
the same sequence as that of a part of the template DNA. 

Condition 2: At least two of the blocks that constitute 
25 the block mixture further have, in addition to said double- 
stranded site, s single-stranded site (cohesive end) that is 
complementary to the cohesive end of blocks that are not 
directly adjacent to said blocks on the template DNA. 
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Condition 3: The block mixture contains at least two 
different blocks which are the same in the double-stranded site 
but are different only in the single-stranded site and which 
satisfy the condition 2 . 
5 21) The method for producing a DNA pool according to the 

previous 20) , wherein the template DNA is a gene that codes for 
an enzymatic function or a control gene DNA for the gene* 

22) The method for producing a DNA pool according to the 
previous 21) , wherein the template DNA is a gene DNA that codes 

10 for any one of proteases, lipases, cellulases, amylases, 
catalases, xylanases, oxidases, dehydrogenases, oxygenases and 
reductases. 

23) The method for producing a DNA pool according to the 
previous 22), wherein the template DNA is one derived from 

15 prokaryotes. 

24) The method for producing a DNA pool according to the 
previous 23) , wherein the template DNA is one derived from 
bacillus bacteria . 

25) The method for producing a DNA pool according to the 
20 previous 24) , wherein the template DNA is a protease API21 

gene. 

26) The method for producing a DNA pool according to any 
one of the previous 20) to 25) , wherein the DNA blocks are 
ligated together using a DNA ligase. 

25 27) A genetic product to be obtained by expressing the 

genetic information on DNA molecules that exist in the DNA pool 
of any one of the previous 18) to 26) . 
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Now the present invention is described in detail 
hereinunder . 

DNA with Cohesive End(s) 

5 The present invention provides a DNA with any desired 

cohesive end or ends (herein referred to as "DNA with cohesive 
end(s)" unless otherwise specifically indicated). The cohesive 
end as referred to herein indicates a single-stranded site as 
protruded from the end of a double-stranded DNA, Such a 
10 cohesive end may be formed when a DNA is cleaved with a 
restriction enzyme such as EcoRI . In this case, however, the 
base sequence of the thus-formed cohesive end is defined, 
depending on the restriction enzyme used, and its length is 
generally composed of several bases or so. If a naturally- 
15 existing DNA is cleaved with a restriction enzyme, the sequence 
of the resulting double-stranded part of the DNA is also 
limited to the region as sandwiched between the restriction 
enzyme recognizing sites. As opposed to this, the DNA with co- 
hesive end(s) of the present invention may have a structure in 
20 which a cohesive end or cohesive ends having a desired length 
and a desired sequence is/ are added to the end or ends of a 
double-stranded DNA having a desired sequence. 

As has been mentioned hereinabove, the sequence of the 
double-stranded part of the DNA with cohesive end(s) of the 
25 present invention is not specifically defined. For example, 
the sequence may be the same as that of a part of a gene. 
Though not also specifically defined, its length may be gene- 
rally composed of 50 base pairs (bp) or more, preferably 45 bp 
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or more. The sequence of the cohesive end is not also specifi- 
cally defined, but in order to prevent the self -ligation 
thereof in various reactions, it is preferable that the 
sequence does not form a stem structure. The "sequence to form 
5 a stem structure" as referred to herein includes, for example, 
AATT, which shall have just the same sequence as that of its 
complementary strand (TTAA) . The length of the cohesive end 
may be generally 2 bp or more, preferably from 15 bp to 3 0 bp. 
If the cohesive end is too long, it may form a secondary struc- 

10 ture whereby the intermolecular annealing will be difficult. 
However, if it is too short, its melting temperature (Tm) is 
low and the annealing will be unstable. 

The cohesive end may be linked to either the 3' -terminal 
or the 5 1 -terminal of the double-stranded DNA , but is prefe- 

15 rably linked to the 3 '-terminal thereof. The cohesive end may 
be linked to either only one terminal of the double-stranded 
DNA or the both terminals thereof. 

Method for Producing DNA with Cohesive End(s) 

20 The DNA with cohesive end(s) of the present invention can 

be produced typically according to a method comprising the 
following steps a) to d) . The method mentioned below is direc- 
ted to the production of a DNA with a cohesive end, which has a 
structure to be represented by a formula (2) : 

25 

(2) 
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wherein X and c are desired sequences; a and S are 
sequences that are complementary to X and c, 
respectively; and is a sequence of a cohesive end, 
and which is based on a double-stranded DNA (template DNA) 
5 having a structure to be represented by a formula (1): 

(1) 

wherein X, c, a f fi, and have the same meanings as 

above . 

10 However, the present invention is not limited to only the 

production illustrated herein, but other DNAs with cohesive 
end(s) having other structures can also be produced in the same 
manner as below according to the present invention. 

15 Step a) 

(a-1) Preparation of Oligonucleotide: 

First, the part that shall be selected as the double- 
stranded part of the intended DNA with a cohesive end is 
defined on a template DNA* An oligonucleotide, a, which is 

20 complementary to its terminal, X, and an oligonucleotide, c, 
having the same sequence as that of the other terminal, c, are 
prepared. X and c each may have a sequence having a base 

length of from 15 to 3 0 bp or so. 

On the other hand, prepared is an oligonucleotide, b, 
25 which is complementary to the sequence to be prepared by 
removing one base (this is referred to as X) from the 5 1 - 
terminal of the sequence of the intended cohesive end, 



WO 98/05765 



15 



PCT/DK97/00317 



The base sequence , &s\ may be a part of the above-mentioned 
DNA or may be any other sequence that the DNA does not have. 

These oligonucleotides, a, b and c, may be prepared by any 
methods. If their sequences are previously known, they can be 
5 synthesized, using a known DNA synthesizer. 

(a-2) Preparation of Ribonucleotide-Containing Fragments: 

Next, the oligonucleotides, a and b, are linked together 
via a ribonucleotide. This linkage can be attained by ordinary 

10 synthesizing methods. For example, it can be attained 

according to the process mentioned below. 

First, a phosphoryl group is added to the 5 1 -terminal of 
the oligonucleotide, a, according to the reaction of the 
following formula (3) : 

is (3) 
wherein (P) is a phosphoryl group. 
This reaction can be effected by the action of a polynucleotide 
kinase. ATP is used in an amount of from 2 to 10 times or so, 
by mol, relative to the oligonucleotide, a. The reaction 

20 temperature may be from 30 to 40°C or so. The reaction time 
may be from 10 minutes to 1 hour or so. Most suitably, the pH 
is from 7 to 9 or so. After the addition of the phosphoryl 
group thereto, the oligonucleotide is represented by a 1 . 
[0024] 

25 On the other hand, a ribonucleotide is added to the 3 1 - 

terminal of the oligonucleotide, b, according to the reaction 
of the following formula (4) : 
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(4) 

wherein X is any one of ATP, GTP, CTP and UTP; (rX) is a 
ribonucleotide. 

This reaction can be effected by the action of, for 
5 example, a terminal deoxynucleotidyl transferase. For the 
nucleoside triphosphate (XTP) to be used herein, is selected a 
ribonucleotide that corresponds to the base X in the previous 
step (a-1) . The nucleoside triphosphate is used in an amount 
of from 2 to 10 times, by mol, relative to the oligonucleotide, 
10 b. The reaction temperature may be from 3 0 to 4 0 °C or so. The 
reaction time may be from 3 0 minutes to 2 hours or so. After 
the addition thereto, the oligonucleotide is represented by b 1 . 
The sequence of b' is complementary to the sequence, 

The thus-obtained oligonucleotides, a 1 and b 1 , are mixed, 
15 whereby the 5 '-terminal (phosphoryl group) of a 1 is bonded to 
the 3'-terminal (hydroxyl group) of the ribonucleotide of b', 
according to the reaction of the following formula (5) : 

(5) . 

20 This reaction can be effected by the action of an RNA 

ligase in the presence of ATP and divalent metal ions (Japanese 
Patent Application Laid-open No. 5-292967) . Divalent metal 
ions useful in this reaction include, for example, magnesium 
ions and manganese ions, but preferred are magnesium ions. As 

25 the ligase, employable is an RNA ligase. The RNA ligase is an 
enzyme to catalyze the ligation of the hydroxyl group at the 
3 1 -terminal and the phosphoryl group at the 5 1 -terminal, and 
this also efficiently catalyzes the ligation of a 
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polydeoxyribonucleotide having a ribonucleotide only at its 3 1 - 
terminal and a polydeoxyribonucleotide with a 5* -terminal phos- 
phoryl group. Preferably used is a T4 RNA ligase. The 
reaction is generally effected in a buffer, at a pH of from 7 
5 to 9 and at a temperature of from 10 to 40°C, over a period of 
from 30 to 180 minutes. For example, the oligonucleotides may 
be reacted in a solution comprising 50 mM Tris-HCl (pH 8.0), 20 
mM MgCl2, 0.1 mM ATP, 10 mg/ liter BSA , 1 mM hexaammine cobalt 

chloride (HCC) and 25 % polyethylene glycol 6000, at 25°C for 
10 60 minutes or longer. 
Step b) 

Using the DNA containing the sequence, X , as prepared in 

the previous step (a-l) , as a template, and using the linked 
oligonucleotide, b'-a', as prepared in the previous step (a-2), 
15 as a primer, prepared is a double-stranded DNA through DNA 
polymerase reaction. In general, a double-stranded DNA con- 
taining the sequence, X, and a sequence, 3, on their strands 

is subjected to thermal or alkaline denaturation to give 
single-stranded DNAs , which are added to the primer of b 1 -a 1 

20 and subjected to PCR with the oligonucleotide, c, as prepared 
in the previous step (a-l) . The primer annealing condition and 
the polymerase reaction condition to be employed herein may be 
the same as those in ordinary polymerase reaction. The DNA 
polymerase to be employed herein may be any and every enzyme 

25 that can catalyze the DNA chain-extending reaction, which 
includes, for example, Taq polymerase, Klenow fragment, DNA 
polymerase I, etc. As a result of this reaction, obtained is a 
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double-stranded DNA with blunt ends, which is represented by 
the a formula (6) : 

(6) . 

Step 

Next, the ribonucleotide is removed from the double- 
stranded DNA through enzymatic reaction or chemical reaction. 
One example of useful enzymes is a ribonuclease . The reaction 
is generally effected at a pH of from 6 to 8 or so, at from 3 0 
to 70 °C or so, over a period of from 10 to 60 minutes or so. 
As non-enzymatic chemicals usable herein, mentioned are sodium 
hydroxide and the like. As a result of this reaction, obtained 
is a partly-discontinuous, double-stranded DNA of the following 
formula (7) , in which the part corresponding to the above- 
mentioned base, X, has been deleted. 

(7) . 

Step d) 

After the above step, the nucleotide existing at the 5 f - 
terminal of the above-mentioned deletion is removed. To remove 
this nucleotide, for example, the double-stranded DNA, from 
which the ribonucleotide has been removed in the previous step 
c) , is heated at from 50 to 90°C or so. The polynucleotide 
that has been separated from the strand through this reaction 
can be removed, using, for example, a span column or the like. 
Thus is obtained the double-stranded DNA with a cohesive end of 
the above-mentioned formula (2) . 

In the process mentioned above, obtained is a double- 
stranded DNA with a cohesive end only at its one 3 '-terminal. 
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In the same manner as this, also obtainable is a double- 
stranded DNA with cohesive ends at the both 3 1 -terminals . 

In the above-mentioned process, a desired sequence, at, 

which does not adjoin the sequence, X, in the template DNA was 
5 introduced into the DNA to form the cohesive end. Apart from 
this, it is also possible to introduce thereinto an additional 
oligonucleotide that adjoin the sequence in the template DNA to 
form another cohesive end. For example, in the embodiment 
mentioned above, an oligonucleotide, c', which is different 
10 from the oligonucleotide, c, in that its 3 '-terminal 
deoxyribonucleotide is substituted with a ribonucleotide, may 
be used as the primer in place of the oligonucleotide, c, to 
prepare a double-stranded DNA with two cohesive ends of a 
formula (8) : 

15 (8) . 

Method of Shuffling DNA 

The present invention also provides a method of shuffling 
a DNA, which is characterized by using DNAs with cohesive 
end(s) . The terminology "shuffling" as referred to herein in- 
20 dicates the operation of dividing a DNA into plural blocks 
followed by re-sequencing them into a desired, different se- 
quence . 

For example, where one DNA has a sequence composed of a 
number, n, of blocks, as represented by a formula (9) : 
25 A - al - a2 - . . . . - a n - B (9) 

wherein the starting end A and/ or the terminal end B may 
be omitted, 
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this may be shuffled according to the present invention to give 
a different DNA to be represented by a formula (10) : 
A - al 1 - a2' - . . . . - a x - B (10) 

wherein al 1 , a2 * , . . . , a x are blocks that are 
5 independently selected from the group of al, a2, • . • , 

a n ; and the total number of the blocks al 1 , a2 1 , . . . , 
ax may not be the same as the total number of the blocks 
a 1 , a2, * • • f a j"j • 

The principle of the DNA shuffling of the present 

10 invention which utilizes DNAs with cohesive end(s) is gra- 
phically illustrated in Fig. 1. In Fig, 1, the DNA is shuffled 
at the intermediate part, pi - p2 - p3 (the uppermost row) into 
p3 - pi - p2 (the lowermost row) , without changing the both 
ends pa and pb- This shuffling operation is useful as a method 

15 for obtaining gene sequences that have not heretofore existed 
naturally, without changing the sequences of the promoter and 
the terminator* 

Concretely, the above-mentioned method of preparing DNAs 
with cohesive end(s) is applied first to the parts pa, pi, p2 , 

20 p3 and pb constituting the template DNA , to thereby prepare DNA 
blocks, al, a2 and a3, each having the structure with two 
cohesive ends (formula (8)), and DNA blocks, A and B, each 
having the structure with one cohesive end (formula (2)). The 
cohesive ends, aA, aif, a2f and a3f , are formed by removing the 

25 corresponding complementary strand from the blocks, pa, pi, p2 
and p3, respectively. 
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The cohesive ends, ai r , a2r/ a 3r and a B* designed ac- 

cording to the desired sequence to be prepared after the 
shuffling. In the embodiment of Fig. 1, the end, ai r is 
designed to be a complementary strand to the end, a3f, and the 
5 block, al is ligated to the block a3 after the shuffling. The 
ligation is conducted, using a DNA ligase in the presence of 
ATP. The type of the DNA ligase to be employed herein is not 
specifically defined. In this embodiment, since the single- 
stranded part of each cohesive end is long, it is unnecessary 

10 to employ the ordinary reaction at 16 °C, but a thermophilic DNA 
ligase is advantageously employed. 

In the embodiment of Fig. 1, a2r/ a 3r and a B a **e designed 
to be the complementary strands to aif, aA and a2f/ respec- 
tively, in the same manner as above. As a result of the shuff- 

15 ling, a sequence having a structure of A - a3 - al - a2 - B is 
finally obtained. This is seemingly the same as the re- 
sequenced order of pa - p3 - pi - p2 - pb to be obtained by di- 
viding the original DNA into the constitutive blocks pi, p2 , 
p3 , pa and pb, followed by re-sequencing these into a different 

20 sequence. 

Any other desired sequences can be realized in the same 
manner as above. If the block, A or B, is made to have two 
cohesive ends, while the other blocks are made to have one co- 
hesive end, it is possible to obtain still different sequences 
25 through shuffling where the latter blocks with one cohesive end 
are positioned at the terminals. 

i 

In the shuffling of the invention, it is also possible to 
introduce foreign DNA block(s) with cohesive end(s), which are 
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not in the original gene, into the gene DNA. For example, it 
is possible to shuffle two or more gene DNAs. In this case, 
the terminal of one gene, for example, the block A in the 
above-mentioned embodiment, may be processed into a DNA block 
with two cohesive ends, if desired. 

The blocks, which are the units to be shuffled, are oligo- 
nucleotides or polynucleotides composed of 2 or more 
nucleotides (hereinafter referred to as "oligonucleotides") . 
In general, these are preferably composed of 3 0 or more 
nucleotide units, more preferably 4 5 or more nucleotide units. 
The uppermost limit of the block length is not specifically 
defined, provided that the block length is shorter than the 
length of one gene. If, however, the block length is too 
large, the re-sequenced DNA to be obtained by the shuffling 
shall have many non-mutated base sequence parts. Therefore, in 
general, the block length is preferably within the range of 
from 10 to 35 % of the length of a gene. 

Where the gene to be shuffled is a gene that codes for a 
protein, it is desirable that the gene blocks, oligonucleotides 
have the same reading frame before and after the division. 
Namely, the gene blocks to be shuffled are desirably so desig- 
ned that they are translated to always give the corresponding 
amino acid sequences, irrespective of their relative positions 
in the shuffled sequence. For this, the double-stranded parts 
and the cohesive ends shall be selected for their codon units 
in accordance with the reading frame of the gene DNA to be 
shuffled. 
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Needless-to-say, it: is unnecessary to conduct the division 
into segment blocks with genetic meanings. Namely, it is unne- 
cessary to conduct the division of the gene DNA into the 
constitutive exons or segment blocks that correspond to the 
5 domains or modules of the protein which the gene DNA codes for. 
There is a probability that the shuffling at such sites would 
have been examined in the natural world in the past. In order 
to obtain base sequences that have not heretofore been examined 
in the natural world, it is desirable that the division of the 

10 gene DNA is effected inside the constitutive exons or at the 
sites corresponding to the inside of the domains or modules of 
the protein which the gene DNA codes for. 

Employing such means, therefore, it is possible to obtain 
proteins which have different structures as a whole from those 

15 of natural proteins but which partly contain amino acid sequen- 
ces that have been confirmed to be useful in the natural world. 
Accordingly, the probability of obtaining useful proteins by 
such means is enlarged, as compared with the means of synthe- 
sizing proteins totally at random. 

20 The kind of the gene to be shuffled according to the pre- 

sent invention is not specifically defined. Employable herein 
is any and every gene that is composed of polynucleotide chains 
and contains a coding region necessary for expressing a protein 
or RNA. The nucleotide unit may contain any molecule of deoxy- 

25 ribonucleotides or ribonucleotides. For the purpose of finding 
out useful base sequences, preferred are genes coding for pro- 
teins, especially enzymes, or control genes for enzymatic 
functions. Examples of such enzymes include proteases, lipa- 
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ses, cellulases, amylases, catalases, xylanases, oxidases, 
dehydrogenases, oxygenases and reductases. 

The kind of the gene to which the present invention is 
directed is not specifically defined but shall be such that, 
5 when it is introduced into a suitable host, the host can pro- 
duce the genetic product through expression of the gene. As 
examples, referred to are genes as cloned from living 
organisms, artificially synthesized genes, and even genes as 
cloned from living organisms and artificially mutated* For the 

10 genes derived from living organisms, employable are prokaryotes 
with definite enzyme producibility . As examples of such proka- 
ryotes, mentioned are bacillus bacteria. One example of the 
genes derived from such bacteria is a protease API21 gene 
derived from Bacillus NKS-21 (FERM BP-93-1) (Japanese Patent 

15 Application Laid-open No. 5-91876, Sequence Number 1). 

DNA POOl 

The present invention also provides a DNA pool to be 
obtained according to the above-mentioned shuffling method. 

20 The "DNA pool" as referred to herein means a high-density mix- 
ture of two or more DNAs. The DNA pool of the present in- 
vention can contain a particular number or more, for example, 
10 or more different DNA molecules having different structures. 
It is desirable that, when the mixture, DNA pool is directly 

25 used in biochemical operation or reaction, it is in such a form 
that all the plural nucleic acid components constituting it can 
be reacted. However, the form of the mixture, DNA pool is not 
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specifically defined, and the DNA pool may be either in 
solution or dry mixture. 

To produce the DNA pool, for example, a plurality of 
cohesive ends for each block are prepared in the above-men- 
5 tioned shuffling process. Referring to the embodiment of Fig. 
1, for example, when, for the cohesive end ai r of the block al, 

complementary strands to the other cohesive ends, a^ and a2f / 
are prepared in addition to the complementary strand to a3f, 
then DNAs of A - al - a2 - B and A — al -a2 - al can be ob- 
10 tained. If a complementary strand to the other cohesive end 
alf of al is added, it is also possible to produce other DNAs 
comprising a series of the same blocks, such as A - al - al - 
al. 

In the same manner, for the cohesive ends of a2 and a3 , if 
15 oligonucleotides that are complementary to the cohesive ends of 
the other blocks or complementary to the other cohesive end of 
themselves are added, other sequences comprising these can be 
produced. 

In general, a DNA is divided into blocks of al, a2 , a3 , . 

20 . . , an* Then, each block is processed to have a cohesive end 
or cohesive ends according to the above-mentioned process. The 
cohesive ends are designed to be oligonucleotides that are 
complementary to the cohesive ends of the other blocks or are 
complementary to the other cohesive end of themselves. All or 

25 a part of the thus-obtained DNA blocks are mixed and ligated to 
each other, thereby producing a nucleic acid pool containing 
different nucleic acids composed of the blocks as differently 
sequenced at random. 
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Expression of Genetic Information in Shuffled DNA or DNA Pool 

The thus-shuffled, single or mixed, double-stranded DNAs 
are blunted. The blunting may be omitted, if DNA blocks with 
one cohesive end are positioned at the ends of the shuffled, 
5 double-stranded DNA, For example, the 5 '-terminal of the se- 
quence containing a DNA block with a predetermined promoter se- 
quence, which is based on the direction of the promoter, is not 
made to have a cohesive end but is made to have a blunt end, 
while the 3 1 -terminal of the sequence containing a DNA block 

10 with a predetermined terminator sequence, which is based on the 
direction of the terminator, is not made to have a cohesive end 
but is also made to have a blunt end. In that manner, it is 
possible to directly obtain a gene in which the blocks of the 
intended gene have been shuffled between the promoter and the 

15 terminator, without blunting it. After this, the thus-shuffled 
DNA is inserted into a desired vector, preferably an expression 
vector such as pKK223-3, using a DNA ligase. the promoter 
sequence and the terminator sequence to be in the shuffled DNA 
are not limited to only one each, but a plurality of promoter 

20 sequences and terminator sequences may be therein. 

If desired, the polynucleotide blocks positioned at the 
both ends of the shuffled DNA may be designed to have suitable 
restriction enzyme recognizing sites. In this case, the DNA 
may be ligated to a suitable vector, using the defined re- 

25 striction enzymes. 

Next, the vector library thus produced in the manner 
mentioned above is introduced into a suitable host, in which 
the genetic information is expressed. Thus, the intended gene- 
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tic product with favorable properties and also the gene coding 
for it can be obtained. Any and every ordinary host can be 
used herein. Preferred examples of the host include cells of 
E. coll , bacillus bacteria, yeasts, and lactic acid bacteria. 
5 If desired, in-vitro transcription systems and translation 

systems are also employable herein. In those cases, the gene- 
tic information can be expressed even when the gene is not 
ligated to a vector. 

The "genetic information" as referred to herein indicates 

10 the information on a gene which is carried by a DNA and which 
is translated into a protein or is transcribed into RNA in a 
suitable living body by the DNA for itself or after having been 
ligated to any other DNA or RNA. 

The genetic information that is expected to be expressed 

15 according to the method of the present invention is not 
specifically defined, but includes, for example, those on 
various genetic products, such as enzymes, antibodies, hormones 
receptor proteins and ribozymes, and those on various control 
functions of, for example, operators, promoters and attenua- 

20 tors. 

Examples 

Now, the present invention is described in detail 
hereinunder with reference to the following examples, which, 
25 however, are not intended to restrict the scope of the present 
invention. 
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Example 1; Production of DNA Pool 

A nucleic acid pool was produced in accordance with the 
process mentioned below , based on the wild-type alkali protease 
(Japanese Patent Application-Laid Open No. 5-91876) as cloned 
5 from a protease API21 (Bacillus NKS-21; FERM BP-93-1) having a 
sequence of Sequence Number 1. 

(1) Step a) : Preparation of Oligonucleotide Blocks for Primer 
(1-1) Synthesis of Oligonucleotide Blocks: 

Using an automatic DNA synthesizer, Model 3 92 

10 (manufactured by Perkin Elmer Co.)/ synthesized were 14 
oligonucleotides; oligo FW (Sequence Number 2) , oligo RV 
(Sequence Number 3) , oligo lr (Sequence Number 4) , oligo lb 
(Sequence Number 5) , oligo la (Sequence Number 6) , oligo 2r 
(Sequence Number 7) , oligo 2b (Sequence Number 8) , oligo 2a 

15 (Sequence Number 9) , oligo 3r (Sequence Number 10) , oligo 3b 
(Sequence Number 11) , oligo 3a (Sequence Number 12) , oligo 4r 
(Sequence Number 13), oligo 4b (Sequence Number 14), oligo 4a 
(Sequence Number 15) and oligo A (Sequence Number 16) . These 
are parts of the base sequence of API21 (Japanese Patent 

20 Application Laid-open No. 5-91876) (including complementary 
strands) , or oligonucleotides containing a part of the base 
sequence. However, the sequence of oligo 4a is to follow 
glutamine of Sequence Number 1 and, and this contains a 
termination codon of the gene. These oligonucleotides were so 

25 designed that they might be the best when the oligo A was 
overhung on the 3* -terminal of the amplified DNA in the 
experiment to follow hereinunder, using a Taq polymerase. 
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These oligonucleotides were synthesized in a DM trityl-on 
condition (that is, while the 5 1 -hydroxy 1 group was protected 
with dimethoxytrityl group) , and purified through an OPC 
column. The reagents used herein were obtained from Perkin 
5 Elmer Co. 

(1-2) Addition of Ribonucleotide to Blocks: 

Next, 500 pmols of oligo lr, 1 nmol of ATP and 10 units of 
terminal deoxynucleotidyl transferase were added to a standard 
solution comprising : 
10 50 mM Tris-HCl buffer (pH 8.0) 

10 mM MgCl 2 

5 mM DTT (dithiothreitol ) 
25 % PEG 6000 

1 mM HCC (hexaammine cobalt chloride) 
15 10 Mg/ml BSA (bovine serum albumin) , 

to thereby make 10 jul in total. The resulting solution was 

left at 37 °C for 1 hour. 

Oligo 2r, oligo 3r, oligo 4r, oligo lb, oligo 2b, oligo 3b 

and oligo 4b were processed in the same manner as above. These 
20 four polynucleotides thus formed are referred to as oligo lr 1 , 

oligo 2r', oligo 3r', oligo 4r», oligo lb', oligo 2b 1 , oligo 

3b 1 and oligo 4b ' . 

( 1-3 ) Phosphorylation : 

500 pmols of oligo la, 1 nmol of ATP and 10 units of 
25 polynucleotide kinase were dissolved in the standard solution 

having the same composition as above to make 10 /ul in total. 

The resulting solution was left at 3 7__C for 1 hour. Oligo 2a, 

oligo 3a and oligo 4a were processed in the same manner as 
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above. These polynucleotides thus formed are referred to as 
oligo la 1 , oligo 2a 1 , oligo 3a 1 and oligo 4a f . 
(1-4) Ligation of Oligonucleotide Blocks: 

500 pmols of oligo la', 100 pmols of oligo lb 1 , 100 pmols 
5 of oligo 2b 1 , 100 pmols of oligo 3b' , 100 pmols of oligo 4b 1 , 
which had been obtained in the above, as well as 1 nmol of ATP 
and 50 units of T4 RNA ligase were added to the same standard 
solution as that mentioned above to make 10 jul in total, and 
these were reacted at 25 °C for 4 hours. 

10 The other combinations, oligo 2a 1 with oligo lb 1 , oligo 

2b* , oligo 3b* and oligo 4b 1 ; oligo 3a 1 with oligo lb', oligo 
2b' , oligo 3b 1 and oligo 4b f ; and oligo 4a f with oligo lb', 
oligo 2b 1 , oligo 3b* and oligo 4b 1 , were also reacted in the 
same manner as above. A mixture of the four polynucleotides 

15 thus formed as a result of this reaction, oligo la 1 ligated to 
oligo lb 1 , oligo 2b', oligo 3b 1 and oligo 4b 1 , is referred to 
as oligo 1M; a mixture of the four polynucleotides, oligo 2a 1 
ligated to oligo lb 1 , oligo 2b f , oligo 3b' and oligo 4b 1 , is 
referred to as oligo 2M; a mixture of the four polynucleotides, 

20 oligo 3a 1 ligated to oligo lb', oligo 2b 1 , oligo 3b' and oligo 
4b 1 , is referred to as oligo 3M; and a mixture of the four 
polynucleotides, oligo 4a' ligated to oligo lb 1 , oligo 2b', 
oligo 3b f and oligo 4b 1 , is referred to as oligo 4M. 
(2) Steps b) to d) : Formation of Gene Blocks 

25 A template, plasmid pSDT812 (Japanese Patent Application 

Laid-open No. 1-14159 6) , which had been prepared by inserting, 
into the Clal cleaving site of pHSG396, the gene of the wild- 
type alkali protease as cloned from Bacillus NKS-21, was 
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subjected to PCR with primers, oligo 1M and oligo 2r". The 
gene fragment as amplified through this reaction was treated 
with a ribonuclease, and then heated at 8 0_C for 5 minutes, 
whereby the polynucleotide (s) positioned at the 5* -terminal of 
5 the ribonucleotide existing in the both strands or one strand 
was/were removed. As a result of this, prepared was a gene 
block with cohesive end(s) . This gene block is referred to as 
block 1M. 

The other four combinations, oligo 2M and oligo 3r f , oligo 
10 3M and oligo 4r', oligo 4M and oligo RV, and oligo FW and oligo 
lr f , were processed in the same manner as above. These blocks 
thus prepared are referred to as block 2M, block 3M, block B, 
and block F, respectively. 

15 Example 2: Shuffling 

Block 1M, block 2M, block 3M, block B and block F of the 
same amount were blended and ligated together, using Pfu DNA 
ligase. 

After the ligation, the reaction mixture was subjected to 
20 agarose gel electrophoresis, through which was collected the 
DNA fragment of about 1.5 kbp. 

Example 3: Identification of Nucleic Acid Pool 

The thus-collected DNA of about 1.5 kbp was digested with 
25 restriction enzymes, EcoRI and BamHI, then mixed with a 
plasmid, pHY300PLK (manufactured by Yakulto Honsha Co.), which 
had been digested with restriction enzymes, EcoRI and BamHI and 
processed with an alkali phosphatase, and thus ligated 
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together, using a ligation kit (manufactured by Takara Shuzo 
Co.)* Using the resulting DNA, cells of E. coll «TM105 were 
transformed, from which were selected tetracycline-resistant 
transf ormants . From these transf ormants , plasmid DNAs were 
5 extracted, purified and analyzed according to ordinary methods. 
Thus were obtained 97 clones with a DNA of 1.5 kbp as inserted 
between the EcoRI and BamHI recognizing sites of pHY300PLK. 

The base sequences of these DNAs thus obtained in the 
manner mentioned above were sequenced to analyze how block 1M, 

10 block 2M, block 3M, block F and block B were ligated in what 
order or, that is, how these were shuffled. As in the 
principle, block F was positioned at the first site while block 
B at the fifth site, and block 1M, block 2M and block 3M were 
shuffled between the two. Table 1 shows different types of 

15 shuffling, and the number of clones with each type. 
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Table 1 



Type of Shuffling 


Number 
of 

Clones 


Type of Shuffling 


Number of 
Clones 


111 


2 


223 


2 


112 


5 


231 


5 


113 


2 


232 


2 


121 


3 


233 


3 


122 


4 


311 


2 


123 


7 


312 


6 


131 


4 


313 


5 


132 


5 


321 


7 


133 


3 


322 


2 


211 


1 


323 


5 


212 


5 


331 


2 


213 


4 


332 


5 


221 


1 


333 


2 


222 


3 







As in the above, it has been confirmed that, if three 
5 blocks of one gene are shuffled according to the method of the 
present invention, a nucleic acid pool is obtained that covers 
all combinations of clones each containing the same or 
different three of these blocks. 

10 Example 4: Screening of Genetic Products Obtained from Nucleic 
Acid Pool 

The DNAs as produced in Example 3 were mixed. Using the 
resulting DNA mixture, cells of Bacillus subtilis UOT0999 were 
transformed. Tetracycline-resistant transf ormants were selec- 
15 ted. 3 00 transf ormants were replicated on a skim milk-con- 
taining medium plate, on which were found clear zones around 
the colonies of 12 transf ormants . Accordingly, it is under- 
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stood that the enzyme which the shuffled gene codes for can be 
selected depending on its activity. The base sequences of 
these 12 clones that formed the clear zones were analyzed, from 
which it was found that these were sequenced in the same order 
5 of blocks as in the wild-type enzyme* 

Example 5 : Detection of Genetic Products 

From 10 clones (one clone forms halo, while 9 clones do 
not) as selected from the transf ormants that had obtained in 

10 Example 3, and also from the host, Bacillus subtills UOT0999, 
full-length RNAs were prepared. These were processed with a 
ribonuclease-f ree deoxyribonuclease , in order to remove the 
influence of the plasmids on the hybridization to be effected 
later on. Next, using oligo lr as the probe, these were subj- 

15 ected to Northern hybridization. As a result, all lanes cor- 
responding to the RNA of the transf ormants gave detectable 
bands, but no band was detected on the lanes corresponding to 
the RNA of the host. 

20 Advantages of the Invention 

According to the present invention, provided is a double- 
stranded DNA molecule with any desired cohesive end or ends. 
Using this, it is possible to obtain various DNAs with various 
base sequences which are substantially apart from the 
25 naturally-existing base sequence spaces, and also a DNA pool of 
a mixture of such DNAs, through simple processes. Therefore, 
it is possible to obtain excellent genetic products, such as 
proteins and enzymes, which could not be obtained in 
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conventional methods and which were not examined by organisms 
in the past. In addition, according to the method of the pre- 
sent invention for producing a nucleic acid pool, it is 
possible to obtain a mixture of nucleic acids while optionally 
5 shuffling the constitutive blocks at random in the intermediate 
parts but fixing the terminal sequences to be predetermined, 
desired ones, and it is also possible to shuffle the 
constitutive blocks without changing the amino acid sequence 
which each block codes for. Therefore, as compared with a me- 
10 thod of producing a completely-randomized nucleic acid pool, 
there is a high possibility that useful genetic products can be 
produced according to the method of the present invention. 
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Sequence Listing 



Sequence Number: 1 

Length of Sequence: 1122 
5 Type of Sequence: Nucleic Acid 

Number of Strands: Double-stranded 
Topology: Linear 
Kind of Sequence: Genomic DNA 
Source: Bacillus NKS-21 (FERM BP-93-1) 
10 Characteristics of Sequence: 

Code Indicating Characteristics: Sig Peptide 
Existing Site: 1 ... 93 
Method of Determining Characteristics: S 
Code Indicating Characteristics: Mat Peptide 
15 Existing Site: 104 . - . 1112 

Method of Determining Characteristic: S 

Sequence: 



ATG AAT CTT CAA AAA ATA GCC TCA GCG TTG AAG GTT AAG CAA TCG GCA48 
Met Asn Leu Gin Lys lie Ala Ser Ala Leu Lys Val Lys Gin Ser Ala 



20 




-100 










■95 










-90 












TTG 


GTC 


AGC 


AGT 


TTA 


ACT 


ATT 


TTG 


TTT 


CTA 


ATC 


ATG 


CTA 


GTA 


GGT 


ACG96 




Leu 


Val 
-85 


Ser 


Ser 


Leu 


Thr 


He 
-80 


Leu 


Phe 


Leu 


He 


Met 
-75 


Leu 


Val 


Gly 


Thr 






ACT 


AGT 


GCA 


AAT 


GGT 


GCG 


AAG 


CAA 


GAG 


TAC 


TTA 


ATT 


GGT 


TTC 


AAC 


TCA 


144 


25 


Thr 
-70 


Ser 


Ala 


Asn 


Gly 


Ala 
-65 


Lys 


Gin 


Glu 


Tyr 


Leu 
-60 


He 


Gly 


Phe 


Asn 


Ser 
-55 






GAC 


AAG 


GCA 


AAA 


GGA 


CTT 


ATC 


CAA 


AAT 


GCA 


GGT 


GGA 


GAA 


ATT 


CAT 


CAT 


192 




Asp 


Lys 


Ala 


Lys 


Gly 
-50 


Leu 


He 


Gin 


Asn 


Ala 
-45 


Gly 


Gly 


Glu 


He 


His 
-40 


His 




30 


GAA 


TAT 


ACA 


GAG 


TTT 


CCA 


GTT 


ATC 


TAT 


GCA 


GAG 


CTT 


CCA 


GAA 


GCA 


GCG 


240 




Glu 


Tyr 


Thr 


Glu 
-35 


Phe 


Pro 


Val 


He 


Tyr 
-30 


Ala 


Glu 


Leu 


Pro 


Glu 
-25 


Ala 


Ala 






GTA 


AGT 


GGA 


TTG 


AAA 


AAT 


AAT 


CCT 


CAT 


ATT 


GAT 


TTT 


ATT 


GAG 


GAA 


AAC 


288 




Val 


Ser 


Gly 


Leu 


Lys 


Asn 


Asn 


Pro 


His 


He 


Asp 


Phe 


He 


Glu 


Glu 


Asn 




35 






-20 










-15 










-10 












GAA 


GAA 


GTT 


GAA 


ATT 


GCA 


CAG 


ACT 


GTT 


CCT 


TGG 


GGA 


ATC 


CCT 


TAT 


ATT 


336 




Glu 


Glu 
-5 


Val 


Glu 


He 


Ala 


Gin 
1 


Thr 


Val 


Pro 


Trp 

5 


Gly 


He 


Pro 


Tyr 


He 
10 
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TAC TCG GAT GTT GTT 
Tyr Ser Asp Val Val 

15 

GTA GCA GTA CTT GAT 
5 Val Ala Val Leu Asp 

30 

AGA GGA GGA GTA AGC 
Arg Gly Gly Val Ser 
45 

10 AAT GGT CAC GGT ACT 
Asn Gly His Gly Thr 
60 

TCA TAT GGC GTA TTG 
Ser Tyr Gly Val Leu 
15 75 

AAA GTT CTT GAT CGT 
Lys Val Leu Asp Arg 

95 

GGA ATT GAA TGG GCG 
20 Gly lie Glu Trp Ala 

110 

TTA GGA AGT CCT TCT 
Leu Gly Ser Pro Ser 
125 

25 GCT AGG AAT GCA GGT 
Ala Arg Asn Ala Gly 
140 

CAA CAA GGC GGC TCG 
Gin Gin Gly Gly Ser 
30 155 

GTC ATG GCT GTT GGA 
Val Met Ala Val Gly 

175 

TCA AGC TAT GGA TCA 
35 Ser Ser Tyr Gly Ser 

190 

AAC AGT ACG TAT TTA 
Asn Ser Thr Tyr Leu 
205 

40 ATG GCA TCT CCA CAT 
Met Ala Ser Pro His 
220 

CAC CCT CAC TTA ACG 
His Pro His Leu Thr 
45 235 

GCA ATT CCG CTT GGT 
Ala lie Pro Leu Gly 

255 

GCT GAG TAT GCG GCT 
50 Ala Glu Tyr Ala Ala 

270 



CAT CGT CAA GGT TAC TTT 
His Arg Gin Gly Tyr Phe 

20 

ACA GGA GTG GCT CCT CAT 
Thr Gly Val Ala Pro His 

35 

TTT ATC TCT ACA GAA AAC 
Phe lie Ser Thr Glu Asn 
50 

CAC GTA GCT GGT ACT GTA 
His Val Ala Gly Thr Val 
65 

GGA GTG GCT CCT GGA GCT 
Gly Val Ala Pro Gly Ala 
80 85 
AAC GGA AGC GGT TCG CAT 
Asn Gly Ser Gly Ser His 

100 

ATG AAT AAT GGG ATG GAT 
Met Asn Asn Gly Met Asp 

115 

GGG TCT ACA ACC CTG CAA 
Gly Ser Thr Thr Leu Gin 
130 

GTC TTA TTA ATT GGG GCG 
Val Leu Leu lie Gly Ala 
145 

AAT AAC ATG GGC TAC CCA 
Asn Asn Met Gly Tyr Pro 
160 165 
GCG GTG GAC CAA AAT GGA 
Ala Val Asp Gin Asn Gly 

180 

GAA CTT GAG ATT ATG GCG 
Glu Leu Glu lie Met Ala 

195 

AAT AAC GGA TAT CGC AGT 
Asn Asn Gly Tyr Arg Ser 
210 

GTT GCT GGG GTA GCT GCA 
Val Ala Gly Val Ala Ala 
225 

GCG GCA CAA ATT CGT AAT 
Ala Ala Gin lie Arg Asn 
240 245 
AAC AGC ACG TAT TAT GGA 
Asn Ser Thr Tyr Tyr Gly 

260 

CAA 
Gin 
272 



GGG AAC GGA GTA AAA 384 
Gly Asn Gly Val Lys 

25 

CCT GAT TTA CAT ATT 432 
Pro Asp Leu His lie 
40 

ACT TAT GTG GAT TAT 480 
Thr Tyr Val Asp Tyr 
55 

GCT GCC CTA AAC AAT 528 
Ala Ala Leu Asn Asn 
70 

GAA CTA TAT GCT GTT 576 
Glu Leu Tyr Ala Val 

90 

GCA TCC ATT GCT CAA 624 
Ala Ser lie Ala Gin 

105 

ATT GCC AAC ATG AGT 672 
lie Ala Asn Met Ser 
120 

TTA GCA GCA GAC CGC 720 
Leu Ala Ala Asp Arg 
135 

GCT GGA AAC TCA GGA 7 68 
Ala Gly Asn Ser Gly 

150 

GCG CGC TAT GCA TCT 816 
Ala Arg Tyr Ala Ser 

170 

AAT AGA GCG AAC TTT 864 
Asn Arg Ala Asn Phe 

185 

CCT GGT GTC AAT ATT 912 
Pro Gly Val Asn lie 
200 

TTA AAT GGT ACG TCA 960 
Leu Asn Gly Thr Ser 
215 

TTA GTT AAA CAA AAA1008 
Leu Val Lys Gin Lys 
230 

CGT ATG AAT CAA ACA1056 
Arg Met Asn Gin Thr 

250 

AAT GGC TTA GTG GAT1104 
Asn Gly Leu Val Asp 

265 

1122 



55 
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Sequence Number: 2 

Length of Sequence: 20 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
5 Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence: 

GATTTTAGAA TTCGCAGCGG 

10 Sequence Number: 3 

Length of Sequence: 25 

Type of Sequence: Nucleic Acid 

Number of Strand: Single-stranded 

Topology: Linear 
15 Kind of Sequence: Other Nucleic Acid, Synthetic DNA 

Sequence: 

CCGGATTCCT TAAAGCCCTG AATAA 

Sequence Number: 4 
20 Length of Sequence: 17 

Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
2 5 Sequence: 

ACAGTCTGTG CAATTTC 

Sequence Number : 5 
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Length of Sequence: 17 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence: 

GAAATTGCAC AGACTGT 

Sequence Number: 6 

Length of Sequence: 20 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence : 

CCTTGGGGAA TCCCTTATAT 

Sequence Number: 7 

Length of Sequence: 17 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence: 

CCCAATACGC CATATGA 
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Sequence Number: 8 

Length of Sequence: 17 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence: 

TCATATGGCG TATTGGG 



10 



Sequence Number: 9 

Length of Sequence: 20 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
15 Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence: 

GTGGCTCCTG GAGCTGAACT 



20 Sequence Number: 10 

Length of Sequence: 16 

Type of Sequence: Nucleic Acid 

Number of Strand: Single-stranded 

Topology: Linear 
25 Kind of Sequence: Other Nucleic Acid, Synthetic DNA 

Sequence: 

TCTGATCCAT AGCTTG 
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Sequence Number: 11 

Length of Sequence: 16 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
5 Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence : 

CAAGCTATGG ATCAGA 

10 Sequence Number: 12 

Length of Sequence: 20 

Type of Sequence: Nucleic Acid 

Number of Strand: Single-stranded 

Topology: Linear 
15 Kind of Sequence: Other Nucleic Acid, Synthetic DNA 

Sequence : 

CTTGAGATTA TGGCGCCTGG 

Sequence Number: 13 
20 Length of Sequence: 17 

Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
25 Sequence: 

TGAGCCGCAT ACTCAGC 

Sequence Number: 14 
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Length of Sequence: 17 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence: 

GCTGAGTATG CGGCTCA 

Sequence Number: 15 

Length of Sequence: 20 
Type of Sequence: Nucleic Acid 
Number of Strand: Single-stranded 
Topology: Linear 

Kind of Sequence: Other Nucleic Acid, Synthetic DNA 
Sequence: 

TAATCCCTAA GGATGTACTG 



Brief Description of the Drawing 

Fig. 1 is a graphical view showing one embodiment of the method of 
the present invention for shuffling a DNA. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13o/s) 



A. The indications made below relate to the microorganism referred to in the description 
on page 24 , lines 12-15 to page lines 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 

National Institute of Bioscience and Human-Technology, Agency of Industrial Science and 
Technology, Ministry of International Trade and Industry 

Address of depositary institution (including postal code and country) 



1-3 Higashi 1-chome, Tsukuba-shi, Ibaraki-ken, Japan 



Date of deposit 



7 May 1985 



Accession Number 



FERM BP-93-1 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional 



Until the publication of the mention of grant of a European patent or, where applicable, for twenty years from the 
date of filing if the application has been refused, withdrawn or deemed withdrawn, a sample of the deposited micro- 
organism is only to be provided to an independent expert nominated by the person requesting the sample (cf. Rule 
28(4) EPC). And as far as Australia is concerned, the expert option is likewise requested, reference being had to 
Regulation 3.25 of Australia Statutory Rules 1991 No 71. Also, for Canada we request that only an independent 
expert nominated by the Commissioner is authorized to have access to a sample of the microorganism deposited. 

D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indica- 
tions e.g., "Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international 
application 



Authorized officer 

Head Clerk 




For International Bureau use only 



This sheet was received bv the International Bureau 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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CLAIMS 

1. A DNA with a cohesive end comprising (a) a double- 
stranded DNA having the same sequence as that of a part of a 
gene, and (b) a single-stranded DNA having a base sequence that 
5 exists on said gene at the site not adjoining the part 
corresponding to said double-stranded DNA or a base sequence 
which said gene does not have, wherein the single-stranded DNA 
is linked to either one end of the double-stranded DNA to form 
a cohesive end. 

10 2. A DNA with cohesive ends comprising (a) a double- 

stranded DNA having the same sequence as that of a part of a 
gene, (b) a first, single-stranded DNA having a base sequence 
that exists on said gene at the site not adjoining the part 
corresponding to said double-stranded DNA or a base sequence 

15 which said gene does not have, and (c) a second, single- 
stranded DNA having a base sequence that exists on said gene at 
the site adjoining the part corresponding to said double- 
stranded DNA, wherein the second, single-stranded DNA is linked 
to said double-stranded DNA at one end corresponding to said 

20 adjoining site, while the first, single-stranded DNA is linked 
thereto at the other end of the complementary strand opposite 
to said end, thereby forming cohesive ends, 

3, The DNA with a cohesive end or cohesive ends as 
claimed in claim 1 or 2 , wherein the single-stranded DNA has a 

25 length of 2 bases or more. 

4. The DNA with a cohesive end or cohesive ends as 
claimed in any one of claims 1 to 3 , wherein the cohesive 
end/ends is/are positioned at the 3 ' -terminal/ terminals . 
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5. A method for producing a DNA with a cohesive end or 
cohesive ends, wherein a part of a DNA, as a template, and an 
oligonucleotide containing at least one ribonucleotide, as a 
primer, are subjected to DNA polymerase reaction to prepare a 

5 double-stranded DNA, then the ribonucleotide (s) is/are removed 
through enzymatic reaction or chemical reaction, and the 
nucleotide (s) remaining at the 5 • -terminal (s) of the site(s) at 
which said ribonucleotide (s) existed are removed. 

6. A method for producing the DNA with a cohesive end as 
10 set forth in claim 1, comprising the following steps a) to d) : 

a) a step of linking (i) an oligonucleotide having the 
same base sequence as that of a part of a gene DNA to (ii) an 
oligonucleotide having a base sequence that exists on the gene 
at the site not adjoining the base sequence of (i) or a base 

15 sequence which the gene does not have, and containing at least 
one ribonucleotide, in such a manner that the oligonucleotide 
(ii) is positioned at the 5 1 -terminal of the oligonucleotide 

(i) ; 

b) a step of preparing a double-stranded DNA through DNA 
20 polymerase reaction between a DNA containing the part 

corresponding to the oligonucleotide (i) in said a) , as a 
template, and the linked oligonucleotide as obtained in the 
previous step a) , as a primer; 

c) a step of removing the ribonucleotide from said double- 
25 stranded DNA through enzymatic reaction or chemical reaction; 

and 

d) a step of removing the nucleotide remaining at the 5'- 
terminal of the site at which said ribonucleotide existed. 
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7. A method for producing the DNA with cohesive ends as 
set forth in claim 2, comprising the following steps a) to d) : 

a) a step of linking (i) an oligonucleotide having the 
same base sequence as that of a part of a gene DNA to (ii) an 

5 oligonucleotide having a base sequence that exists on the gene 
at the site not adjoining the base sequence of (i) or a base 
sequence which the gene does not have, and containing at least 
one ribonucleotide, in such a manner that the oligonucleotide 
(ii) is positioned at the 5 1 -terminal of the oligonucleotide 
io (i); 

b) a step of preparing a double-stranded DNA through DNA 
polymerase reaction between a DNA containing the part cor- 
responding to the oligonucleotide (i) in said a), as a 
template, and (i) the linked oligonucleotide as obtained in the 

15 previous step a) and (ii) an oligonucleotide which is a 
complementary strand of an oligonucleotide existing on the gene 
at the site separated from said oligonucleotide-corresponding 
part by at least 3 bases or more toward the 3 1 -terminal and 
which contains at least one ribonucleotide, as primers; 

20 c) a step of removing the ribonucleotides from said 

double-stranded DNA through enzymatic reaction or chemical 
reaction; and 

d) a step of removing the nucleotides remaining at the 5 1 - 
terminals of the sites at which said ribonucleotides existed. 
25 8 . A method for shuffling a DNA , comprising dividing a 

DNA into a plurality of DNA blocks each having a cohesive end 
or cohesive ends, followed by ligating them together into a 
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sequence that is different from the sequence of the original, 
non-divided DNA. 

9. A method for shuffling a DNA, comprising applying the 
method as set forth in any one of claims 5 to 7 to various 

5 sites of a DNA, thereby dividing the DNA into a plurality of 
DNA blocks each having a cohesive end or cohesive ends, at 
least one block of which shall have a cohesive end that is 
complementary to the cohesive end of another block not having 
been directly adjacent to said one block on the original DNA, 
10 followed by ligating them together into a sequence that is 
different from the sequence of the original, non-divided DNA, 

10. The shuffling method as claimed in claim 8 or 9, 
wherein the DNA is divided into 3 or more blocks. 

11. The shuffling method as claimed in any one of claims 
15 8 to 10, wherein the blocks are ligated together using a DNA 

ligase. 

12. A DNA as shuffled according to the method as set 
forth in any one of claims 8 to 11. 

13. The DNA as claimed in claim 12, wherein a gene coding 
20 for an enzymatic function or a control gene for the gene is 

shuffled. 

14. The DNA as claimed in claim 13, wherein the gene is a 
gene that codes for any one of proteases, lipases, cellulases, 
amylases, catalases, xylanases, oxidases, dehydrogenases, 

25 oxygenases and reductases. 

15. The DNA as claimed in claim 13 or 14, wherein the 
gene is one derived from prokaryotes. 
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16. The DNA as claimed in claim 15, wherein the gene is 
one derived from bacillus bacteria. 

17. The DNA as claimed in claim 16 , wherein the gene is a 
protease API 21 gene. 

5 18 . A DNA pool containing plural kinds of DNAs having 

different structures that are obtained according to the 
shuffling method as set forth in any one of claims 8 to 11. 

19. The DNA pool as claimed in claim 18, which contains 
10 or more kinds of DNAs. 
10 20. A method for producing a DNA pool, comprising 

applying the method as set forth in any one of claims 5 to 7 to 
various sites of a template DNA to thereby prepare a mixture of 
DNA blocks each having a cohesive end or cohesive ends that 
satisfies the following conditions, followed by ligating these 
15 into any desired sequences: 

Condition 1: Each block has a double-stranded site having 
the same sequence as that of a part of the template DNA. 

Condition 2: At least two of the blocks that constitute 
the block mixture further have, in addition to said double- 
20 stranded site, a single-stranded site (cohesive end) that is 
complementary to the cohesive end of blocks that are not 
directly adjacent to said blocks on the template DNA. 

Condition 3: The block mixture contains at least two 
different blocks which are the same in the double-stranded site 
25 but are different only in the single-stranded site and which 
satisfy the condition 2 . 
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21. The method for producing a DNA pool as claimed in 
claim 20, wherein the template DNA is a gene that codes for an 
enzymatic function or a control gene DNA for the gene. 

22. The method for producing a DNA pool as claimed in 
5 claim 21, wherein the template DNA is a gene DNA that codes for 

any one of proteases, lipases, cellulases, amylases, catalases, 
xylanases, oxidases, dehydrogenases, oxygenases and reductases. 

23. The method for producing a DNA pool as claimed in 
claim 22, wherein the template DNA is one derived from 

10 prokaryotes. 

24. The method for producing a DNA pool as claimed in 
claim 23, wherein the template DNA is one derived from bacillus 
bacteria . 

25. The method for producing a DNA pool as claimed in 
15 claim 24, wherein the template DNA is a protease API21 gene. 

26. The method for producing a DNA pool as claimed in any 
one of claims 2 0 to 25, wherein the DNA blocks are ligated 
together using a DNA ligase. 

27. A genetic product to be obtained by expressing the 
20 genetic information on DNA molecules that exist in the DNA pool 

as set forth in any one of claims 18 to 26. 



INTERNATIONAL SEARCH REPORT 



International application No 

PCT/DK 97/00317 



A. CLASSIFICATION OF SUBJECT MATTER 



IPC6: C12N 15/10, C12Q 1/68 // C12N 9/00 

According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC6: C12N, C12Q 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 

SE,DK,FI,N0 classes as above 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 



WPI, MEDLINE, DBA, BIOSIS, SCISEARCH 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Chemistry Letters, Volume 2, 1995, 
Koichi Nishigaki et al , 

"Restrict ion-Enzyme-Nondependent Recombination and 
Rearrangement of DNA (RRR)" page 131 



1-27 



WO 9107506 Al (UNITED STATES OF AMERICA), 30 May 
1991 (30.05.91), fig. 7 and the whole document 



1-4 



WO 9517413 Al (EVOTEC BIOSYSTEMS GMBH), 

29 June 1995 (29.06.95), the whole document, see 
especially page 8, line 3-7, page 9, line 15-16, 
page 16, line 7-11 and claims 



8-19,27 



| x| Further documents are listed in the continuation of Box C. ) xl See P alent family annex. 



* Special categories of cited documents; 

"A" document defining the general state of the art which is not considered 

to be of particular relevance 
*E* erlier document but published on or after the international filing date 

m LT document which may throw doubts on priority claim(s) or which is 
cited to establish the publication date of another citation or other 
special reason (as specified) 

document referring to an oral disclosure, use, exhibition or other 
means 

document published prior to the international filing date but later than 
the priority date claimed 



"T" later document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

"X* document of particular relevance: the claimed invention cannot be 
considered navel or cannot be considered to involve an inventive 
step when the document is taken alone 



document of particular relevance: the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
obvious to a person skilled in the art 



'&* document member of the same patent family 



Date of the actual completion of the international search 



21 November 1997 



Date of mailing of the international search report 

0 I -12- 1997 



Name and mailing address of the ISA/ 

Swedish Patent Office 

Box 5055, S-102 42 STOCKHOLM 

Facsimile No. +46 8 666 02 86 



Authorized officer 

Patrick Anders son 

Telephone No. + 46 8 782 2S 00 



Form PCT/ISA^IO (second sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 



International application No 

PCT/DK 97/00317 



1 Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


WO 9522625 Al (AFFYMAX TECHNOLOGIES N-V.)> 1 
24 August 1995 (24.08.95), the whole document, see 
especially page 44, line 34 - page 45, line 15 


8-19,27 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/DK 97/00317 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



Tb is i nt erna ti ona 1 sea rcb report ba snot been established in respect of certain cl aims under Article 17(2)(a) for the following reasons: 
1. I 1 Claims Nos.: 

*— * because they relate to subject matter not required to be searched by this Authority, namely: 



2. | | Claims Nos.: 

beca use they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried our, specifically: 



3. j^J Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
see next sheet 



1- | | As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims. 

2. | X | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. 



1 | As only some of the required additional search fees were timely paid by the applicant, this international search report 
covers only those claims for which fees were paid, specifically claims Nos.: 



4. PI No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
l — 1 restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest 



The additional search fees were accompanied by the applicant's protest. 
No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (1)) (July 1992) 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/DK 97/00317 



According to PCT rule 13.2, an international application shall relate to one invention only or a 
group of inventions linked by one or more of the same or corresponding "special technical 
features", i.e. features that define a contribution which each of the inventions makes over the 
prior art. 

Such a unifying link would be a DNA sequence with a cohesive end comprising a double 
stranded DNA sequence from a gene linked to at least one single stranded DNA not adjoining 
the double stranded DNA sequence in the gene. However such a DNA sequences are known 
in the art see e.g. WO9107506 or Nishigaki et.al in the search report. No other unifying special 
technical feature have been found. 

The application is considered to comprise of the following inventions: 

Invention 1, claims 1-4: DNA sequence with a cohesive end comprising a double stranded 
DNA sequence from a gene linked to at least one single stranded DNA not adjoining the 
double stranded DNA sequence in the gene. 

Invention 2, claims 5-7 and 20-26, and related pans of claim 27: A method for producing DNA 
sequence with cohesive ends using ribonucleotides as a primer to create a double stranded 
DNA with polymerase reaction, whereafter the ribonucletides are removed to create a cohesive 
end and a method for producing a DNA pool by applying the method 

Invention 3, claim 8-19 and related parts of claim 27: A method for shuffling a DNA and a 
DNA pool containing DNAs obtained by the method. 

In spite of the non-unity all claims have been searched. 



Form PCT/1SA/210 (extra sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 



01/10/97 



International application No. 

PCT/DK 97/00317 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member (i) 



Publication 
date 



WO 9107506 Al 



WO 9517413 Al 



WO 9522625 Al 



30/05/91 



All 6886991 A 



29/06/95 



DE 4343591 A 



24/08/95 



AU 2971495 A 

CA 2182393 A 

CN 1145641 A 

EP 0752008 A 

US 5605793 A 



13/06/91 



22/06/95 



04/09/95 
24/08/95 
19/03/97 
08/01/97 
25/02/97 



Form PCT/ISA/210 (patent family annex) (July 1992) 



